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ABSTRACT 

- — In  randomized  complete  block  design,  we  face  the  problem  of 
selecting  the  best  population.  If  some  partial  information  about 
the  unknown  parameters  is  available,  then  we  wish  to  determine  the 
optimal  decision  rule  to  select  the  best  population. 

In  this  paper,  in  the  class  of  natural  selection  rules,  we 
employ  the  r-optimal  criterion  to  determine  optimal  decision  rules 
that  will  minimize  the  maximum  expected  risk  over  the  class  of 
some  partial  information.  Furthermore,  the  traditional  hypothesis 
testing  is  briefly  discussed  from  the  view  point  of  ranking  and 
selection. 


1.  INTRODUCTION 


(1.1) 


k 

X.  =u+r.+8  +  £ .  ,  Y  x ■  =  0 . 

where  v  is  the  overall  mean,  is  the  i-th  treatment  effect,  6£ 
is  the  ;.-th  block  effect,  and  is  the  error  component  of  (i.t) 
cell.  We  assume  that  the  errors  within  each  block  are  jointly 
normally  distributed.  We  also  assume  that  the  quality  of  a  treat¬ 
ment  is  judged  by  the  largeness  of  's  values.  The  i-th  popula¬ 
tion  is  called  the  best  if  =  max  t  .  In  many  practical  situ- 

l£*<k 

ations,  the  goal  of  the  experimenter  is  to  select  the  best  popula¬ 
tion. 

In  this  paper,  we  shall  use  r-optimal  criterion  to  determine 
the  sample  size  of  a  natural  selection  procedure  so  that  it  will 
minimize  the  maximum  expected  risk  over  the  class  of  some  partial 
information  [cf.  Gupta  and  Huang  (1976)]. 

In  Section  2  some  basic  definitions  and  notations  are  intro¬ 
duced  and  basic  formulation  of  the  problem  is  also  given.  In 
Section  3,  some  useful  expressions  for  the  probability  of  correct 
selection  (PCS)  is  derived,  and  r-optimal  sample  size  is  deter¬ 
mined.  Section  4  deals  with  a  numerical  example  for  illustrative 
purpose.  In  Section  5,  we  discuss  the  relationship  between  a  and 
n*.  Section  6  includes  some  conclusions  and  a  discussion  of  the 
traditional  hypothesis  testing  from  the  view  point  of  ranking  and 
selection.  For  general  reference  of  multiple  decision  procedures, 
see  Gupta  and  Panchapakesan  (1979)  and  Gupta  and  Huang  (1981). 


2.  BASIC  FORMULATION  OF  THE  SELECTION  PROBLEM 

In  R.C.B.D. ,  as  (1.1),  we  assume  that  €?  =  (6-,^ ,. . .  ) ' : 
error  components  within  a-th  block  have  jointly  a  multivariate 
normal  distribution  with  mean  vector  0  =  (0,...,0)'  and  covari¬ 


ance  matrix  Z  =  a 


where  a 


2 


is  unknown  and  x  is  a 


known  constant.  Thus,(Xf  =  X^ f ,. . . ,X^e ) ’  have  joint  multivariate 


normal  distribution  with  mean  vector  =  (o^? , . . .  ,0^) 1  and 
covariance  matrix  z,  where  0^  =  (y  +  +  e2)  for  all  i, 

1  <  i  <  k,  and  any  1,  1  £  1  <  n.  For  all  i,  1  £  i  £  k,  define 
n 

X.  =  (  T  X.  /n).  Then  Y.  =  (X.-X, ,. . . ,X.-X. ) '  are  jointly  suffi- 

I  1  1J6  1  1  IK 

cient  for  (x-  -  t  ,t,  -  O'.  Now,  if  t .  =  max  t  then 

11  1  K  1  g,j«i 

i..  -  i  >_  0  for  all  1  £  i  £  k.  We  consider  a  class  of  natural 

selection  rules  for  i-th  population,  1  £  i  £  k,  as: 

■5<1)<J„)  ■  <0  if  !i  <2-'> 

where  xn  =  (X^X^,. . . ,Xk) ' .  Some  optimal  properties  have  been 
studied  by  several  authors  (see  Gupta  and  Panchapakesan  (1979)). 

So  the  class  of  natural  selection  rules  can  be  denoted  by: 

D  =  <$(?n,l?(?n)  =  (6(1)(?n)’*-*’6(k)(?n)),}-  (2*2) 

The  parameter  space  Q  is  as  follows: 

Q  =  {t  =  (t1  ,. . .  ,Tk) 1  €  R  for  all  i  =  l,...,k}.  (2.3) 

Let  A  be  a  given  positive  constant,  and  for  all  i,  1  £  i  £  k, 
U.  =  it  =  ( ^  ,. . . ,ik) ' 1 1  .  £  +  Ao  for  all  l  t  i} ,  (2.4) 

-  ()  =  !  t  I  >  ^  =  •  •  •  =  ikl .  (2.5) 

k 

and  .  K.i  '  "  (  J  ) • 

k  1  i=0  1 

Let  L^(*;  6^(x  ))  represent  the  loss  function  for  t  € 

0  £  i  £  k+1,  when  the  j-th  population,  1  £  j  £  k,  is  selected. 

Let  for  1  £  j  £  k, 

(cQn)6(j)(xn)  ,fl0  (i  =  0) 

L(i)(r;  6(j)(xn))  =  ^(-^L)<5(j)(?n)  for  t  €jni(l£i£k)  (2.6) 

0  (Wi=k+1) 


where  I  is  some  positive  increasing  function  such  that  £(0)  =  0 
cx2 

and  £ ( x)  =  o( e  ),  c  >  0,  and  Cq  represents  the  sampling  cost 
from  each  population  (Cq  >  0).  So,  for  all  r  €  ,  0  <  i  <  k+1, 


the  loss  function  of  6 ( x n )  is  defined  as: 

L(1)(t;  6(xn))  =  l  L(i)(x;  6(j)(xn)). 
J  ^ 


Similarly,  we  have 


rVU(t;  5n)  =  E{l.(i ' (t  ;  6(xn))} 

and  for  some  p  (prior  distribution)  over  n,  y^(p;  6n)  is 
defined  as: 

y(1)(p;  6n)  =  £{R(i)(i;  sn)K 
Thus,  the  Bayes  risk  of  w.r.t.  p  is  defined  as 


(2.7) 


(2.8) 


(2.9) 


y(p;  O  =  l  y  1  (p;  O- 
n  i=0  n 


(2.10) 


In  this  selection  problem,  it  is  assumed  that  some  partial 
information  is  available.  So  that  we  can  specify  ni  s  Pr(i  €  f^.), 
for  all  i,  0  <  i  <  k+1  and  define 


r  =  { o ( / ^  dp(i)  =  n.,  I  tt.  =  1,0  i  k+1}. 
"i  '  1  i=Q 

If  there  is  no  prior  information,  we  can  assume  that 

itq  - • . . -  ^  “  ( 1  -j ) / ( k+ 1 ) . 

Now  if  there  exists  n*  such  that 

sup  > (P ;  6  *)  =  inffsup  y(p;  5  )}. 
per  n  6n€ D  per  ~n 


(2.11) 


(2.12) 


Then  f  *  is  called  a  r-optimal  decision  rule  and  n*  is  the 
n* 

r-optimal  decision.  In  the  following  discussion,  we  will  deter¬ 
mine  for  this  selection  problem. 


3.  MAIN  RESULTS 


We  can  easily  show  the  following  lemma 

Lemma  3.1.  Suppose  for  any  z,  1  <_  i  <  n,  Xn  =  (X^,. . .  .X^)' 
follows  a  multivariate  normal  distribution  with  mean  vector 
-V  =  (w  +  T]  +  3^,...,  u  +  Tk  +  B  )'  and  covariance  matrix 


Let  Y.  =  (X.-Xj 


,X.-X.  )'  and  p.(i)  =  P  (X.  >_  max  X  )  where 
lx  1  r  1  z?i 


n 

X-  =  (  l  X.  /n)  for  any  i,  (i  =  l,...,k). 

1  £= 1  u 

Then 

a)  Y .  follows  multivariate  normal  distribution  with 
mean  vector  *  and  covariance 


c)  P,(; )  =  f  ii  *(z 
1  ./i  \ 


+  (ii~,9.)/<1 
/0'X)/n 


d>i>(z) , 


(3.3) 


where  k  ^(*)  denotes  the  c.d.f.  of  (k-1 )-variate  normal  distri¬ 
bution  with  mean  vector  0  =  (0,...,0)'  and  covariance  matrix 


and  i(-)  denotes  the  c.d.f.  of  standard 


1  /  (k-1) 


normal  distribution. 


Theorem  3.1.  In  R.C.B.D. ,  for  fixed  a  and  X,  let 


Q  (n)  =  su p{ fi. ( g )  •(! -<t>,  ,( - . . -2 - ))}  (3#4) 

M  g-A  k  1  pTT-'xIaT 

and  H(n)  =  sup  y(p;  O.  Then  there  exists  n*  such  that 

p€r 

H(n*)  =  inf  H(n). 
n>l 


,<n*>  if  H(-n*  )  <  H([n*j) 

where  n*  =  {  and  n*  is  a  positive 

l[n*]  if  H(-n*>)  >  H([n*])  0 


real  number  assumed  to  exist  satisfying  the  following  equations. 


a)  Q  M<no)  =  "^c0TT0^^1'n0'1,k+l^ 

b)  Q^(n*)  >  0 


(3.5) 


<-x-  ([x])  denotes  the  smallest  (largest)  integer  which  is  larger 
(less)  than  or  equal  to  x. 


Proof.  For  any  o_  €  D,  we  have 

— —  ~n 

k+1  /.» 

t(p;  =  I  /o  R  (T*»Odp(T) 
i=0  “i  ' 

*  (con)T,  40pj<,-)M,->^ya,jI1*(_V1) 

P  j  (  T  )  dp  (  T  )  • 

Since 


1)  sup  p.(t)  *  /  .|.k''(2)d}(z)  =  A. 

!«i>  J  •  ~ 

2)  By  Somerville's  paper  (1954)  for  1  ^  i  £  k,  we  have 

k  t.-tz  k 

sup  {  l  i{— — aL)pi(x ) )  =  sup  {  I  it(g.)p.} 
t€$21  j  =  l  °  3  '  g^A  jfi  1  J 


=  sup  {?.(g.)(l-i>k_1( 

9j?.A 


_ 

•^T^xT/n  ’  ’ 


jl 


))}  =  QM(n). 


Thus,  H(n)  =  sup  y(p;  6p)  =  n-tc^)  +  QM(n) ( I-tfq-tt .+1). 

p€T 

Since  there  exists  ng  such  that 

ja)  ()H(n0)  =  •(C07r0)/{1':,0‘\+1) 
fb)  Qj(n*)  >  0, 

thus,  we  have 

H'(n*)  =  0  and  H"(n*)  -  0. 

•n*-  if  H(  n*  )  <  H([n*]) 

So  n*  =  S 

'  Cng]  if  H(<n*>)  >  H([n*]) . 


Lemma  3,2.  (Slepian  (1962))  Let  ( X-j , . . .  ,Xp )  be  multivariate 
normal  with  zero  mean  and  positive  definite  covariance  matrix 
=  {p  —  }  and  be  multivariate  normal  with  zero  mean 

and  positive  definite  covariance  matrix  z  -  {<..}.  Let  p.. 

C.  *  J  '  J 

>  t.y  for  i,  j  =  l,...,n  and  *>..  =  k  •  ■ ,  i  =  l,...,n.  Then 
Pr(xi  i  ar...,Xn  >  an)  1  Pr(Y1  >  a1 . Yr  >  ap). 

We  need  a  numerical  solution  for  n  to  satisfy  the  infimum 
of  H(n). 

Theorem  3.2.  Let  *(•)  be  a  positive  increasing  function 
2 

such  that  e(x)  =  o(ecx  ),  (c  >  0), 

QM(n)  *  supi?(g)(l-».  .,( — j  -  ~a  ))} 

g-  /hhi 

and  H(n)  =  ncQ’0  +  ( 1 -^0_7Tk+l *  Let 


o  l  j  1_,Tn’1k+i  ?  .(^(9*)) 

-  ( k- 1 P ( —) ( ■  — ■  — r  - ? - >. 

0  0  g l  e9*/2n-A) 


(3.6) 


inf  H(n)  =  inf  H(n) 
n>l  n<ng 


Then 


9*  y 

where  g*  is  such  that  - —  e  =  sup|ii-2l  e 

g*  g>M  9 

Proof.  By  Lemma  3.2,  we  have 

<  l-$k_,(t)  <  (k-l)(l-T(t)) 

-t^/2 

and  l-$(t)  <  v  - - ,  for  t  >  0.  Thus 

"  1  fa 


Q,(n)  <  supk(g)(k-1)(i-^(/— ^ L 
ri  g>A  f  \  \/2TT-^ 


(3.7) 


(k-1) 


L 

'*(g)  ‘  4TWT) 


sup<^  e 


9>A  9  J 

Since  fc(g)  is  a  positive  increasing  function  such  that  £(g)  = 
ca2 

o(e  y  )  (c  >  0),  then  there  exists  g*  such  that 


*(g)  e"  WTTaTL  !i ^  e-g*/4(i-x) 


supyy  e 


(3.8) 


By  (3.7)  and  (3.8),  as  n  ►  Q^(n)  decreases  to  0.  We  can  find 

2 

n  =  <  (k.1)2(l-A)(  0  JOljZ - - > 

U  f  "  Vo  2  gi/Zd-A)  I 

9*e 

to  satisfy 

QM("0>  i  1--°--°  • 

n  u  i  ..q  ^k+1 


Now  for  any  n  >  nQ,  QH(n)-QM(n+l)  <  QM(nQ)  <  -q-_— ^  -  ) 

H(nH)-H(n)  =  cQn0-(  1— wQ-nk+1 ) (QM(n)-QM(n+l ) )  ^0.  In  other 
words,  H(n)  is  an  increasing  function  of  n.  Thus, 


1 


! 


I 


1 


i 


inf  H(n)  =  inf  H(n) . 
n_l  n  n^ 

Under  a  finite  domain  of  n,  we  can  solve  for  the  infimum  of 
n*  numerically  by  the  following  algorithm. 

1.  Determine  n^  such  that  (3.6)  holds. 

2.  Determine  a  non-empty  set  C,  where 


QM(n')  1  QM(n,-l)-coV(1'Wl)  | 

c  = 

n* 

♦  n'  1  no 

QM(n‘)  i  QM(n'  +  1)+c01,0/(1'n0‘wk+l)  ) 

3.  If  c  is  a  singleton  consisting  of  n',  then  n*  =  n ' ;  ifc 

has  a  cardinality  >  2,  then  choose  n*  such  that 

H(n*)  =  inf  H(n) . 
n£C 

An  example  is  considered  in  Section  4. 

4.  Numerical  example  for  the  existence  of  n* 


We  consider  a  special  case  of  the  loss  function,  namely, 
•  (g)  =  c 1  g'1 ,  c'  >  0,  g  ;i  a,  a  >  1,  then  e"9  = 

2 

c'ga_1e'9  has  the  maximum  point  at  g*  = 

ma x (  i^TT-'a )(«-!)  .  Thus  nQ  can  be  expressed  as 


2 , 1  -  \  w  c  ’  ’ " " o;Vl,22( «- 1 )  - g;/2(  1  - x ) . 


,  .  ,  xC,  I  -  >  wC1  ^  "0  Vt  I  vt  t 

nn  =  (k-1)  (  --■)(-—  *  — V - >  9* 

U  c0 

Let 


M^_i ( x)  =  sup{tQ‘(l-4>k_1(t,...,t))},  then 


» 


/  n 

Now,  is  a  set  of  all  n'  •  nQ  such  that 


(D 


(n  1  +  1 ) 


</2 


/ 

"(2) 


a/2_M a 

K-l 


cQ7T0/1~'!Q~TTkH-l 

c'(2(l-x))a/2 


_  foV^qfVj. 

c1  (2(  l-X)  ),l/2  ' 

By  using  the  table  3.1  of  Somerville's  paper  (1954),  we  can  compute 

H(n*)  =  inf  H(n)  directly.  Some  r-optimal  sample  sizes  are  given 
n€C 

in  Tables  I  and  II  for  x  =  0.0,  0.5,  n0  =  0.05,  0.10,  0.15, 
a  =  1.0,  2.0,  A  <  0.05,  T<k+1  i  0.0  and  c'/cQ  =  15,  30,  45,  60. 


5.  Sensitivity  analysis  between  a  and  n* 

In  this  section,  we  discuss  some  relationships  between  a  and 
n*.  Since  a  and  n*  depend  on  X,  a,  k,  c'/Cg,  ttq,  nk+1 ,  we  fix 
X  =  0.5,  a  =  1.0,  k  =  4  and  nk+^  =  0.  Let  c'/Cg  change  from  15 
to  30  and  rrg  change  from  0.10  to  0.15.  With  different  values 
of  c'/Cg  and  irQ,  we  get  a  clear  idea  of  the  relationship  between 
A  and  n*.  The  results  are  shown  in  Table  III  and  Fig.  1.  We 
observe  that  the  relation  in  Fig.  l.b  is  more  stable  than  in 
Fig.  l.a  and  the  relation  in  Fig.  l.d  is  more  stable  than  in 
Fig.  l.c.  Thus  for  fixed  c'/Cg,  the  larger  i<q  corresponds  to 
more  stable  relationship  between  a  and  n*.  Similarly,  Fig.  l.a 
is  more  stable  than  Fig.  l.c  and  Fig.  l.b  is  more  stable  than 
Fig.  l.d;  this  means  that  for  fixed  ng,  the  smaller  c'/Cg  corre¬ 
sponds  to  more  stable  relationship  between  a  and  n*. 


6.  Discussion 

In  the  special  case  of  k  =  2,  we  have 
ft  =  {c  =  (i|i  a 2 ^ ^ T i  e  ’  =  ^>2}, 

%  =  {ll ;l  =  r2^’ 

ft  j  =  { r  |  r  i  _>  1 2  +  Ao } , 

^2  =  ^  1 1 T  2  —  M  +  Ao '  ‘ 

2 

and  ft,  =  ft  -  (  U  ft. ) . 

J  i=0  1 

If  we  do  not  know  any  prior  information  about  the  parameters  we 
can  take  Pp ( t  €  fiQ)  =  Pr(i  e  ft^  =  Then  this  is  reduced  to 
the  traditional  problem  of  testing 

(*)  Hq:  t i  =  x 2  vs  H-| :  x^  >_  x^  +  Act. 

It  should  be  pointed  out  that  both  the  type  I  and  type  II  errors 
are  controlled  simultaneously. 

TABLE  III. 

Relationship  between  a  and  n* 


5  0.10  n* 


15  0.15 

30  0.10 


0.35 

0.40 

11 

12 

8 

9 

19 

18 

13 

13 
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In  randomized  complete  block  design,  we  face  the  problem  of  selecting  the 
best  population.  If  some  partial  information  about  the  unknown  parameters  is 
available,  then  we  wish  to  determine  the  optimal  decision  rule  to  select  the 
best  population. 

In  this  paper,  in  the  class  of  natural  selection  rules,  we  employ  the  r- 
optimal  criterion  to  determine  optimal  decision  rules  that  will  minimize  the 
maximum  expected  risk  over  the  class  of  some  partial  information.  Furthermore,  the 
traditional  hypothesis  testing  is  briefly  discussed  from  the  view  point  of  rank- 
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